Timing and dynamics of single cell gene expression in the arabinose utilization system 
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The arabinose utilization system of E. coli displays a stochastic "all or nothing" response at 
intermediate levels of arabinose, where the population divides into a fraction catabolizing the sugar 
at a high rate (ON state) and a fraction not utilizing arabinose (OFF state). Here we study this 
decision process in individual cells, focusing on the dynamics of the transition from the OFF to the 
ON state. Using quantitative time-lapse microscopy, we determine the time delay between inducer 
addition and fluorescence onset of a GFP reporter. Through independent characterization of the 
GFP maturation process, we can separate the lag time caused by the reporter from the intrinsic 
activation time of the arabinose system. The resulting distribution of intrinsic time delays scales 
inversely with the external arabinose concentration, and is compatible with a simple stochastic 
model for arabinose uptake. Our findings support the idea that the heterogeneous timing of gene 
induction is causally related to a broad distribution of uptake proteins at the time of sugar addition. 

Key words: time-lapse microscopy; positive feedback; time-delay; quantitative model; cell-to-cell 
variation; GFP maturation. 



Introduction 

Bacteria have sophisticated signal transduction and 
gene regulatory networks for rapid adaptation to envi- 
ronmental changes. In recent years it became increas- 
ingly recognized, that the dynamical response of these 
biochemical reaction networks is subject to significant 
stochastic fluctuations [T], which can lead to heteroge- 
neous behavior across cellular populations. Examples 
include the transient differentiation of B. subtilis in its 
late exponential phase |H |3] , bacterial persistence in E. 
coli [4], and the mating pheromone response pathway in 
yeast [5]. In many of these systems, positive feedback 
plays a fundamental role, since it gives rise to bistability 
and thereby causes two clearly distinct gene expression 
states [6]. It has been demonstrated that biochemical 
noise induces stochastic transitions between the two "sta- 
ble" states, and it was suggested that the resulting pop- 
ulation heterogeneity provides selective advantages for 
colony growth in fluctuating environments 18] . 

A prototypic class of positive feedback systems are the 
inducible sugar utilization systems, in which bistability 
is caused by the autocatalytic positive feedback of the 
sugar on its own uptake proteins. These systems allow 
bacteria to grow on less favorable carbon sources than 
glucose: For instance, in a medium where lactose is the 
only energy source, E. colts lactose utilization (lac) sys- 
tem either imports and catabolizes lactose at a high rate 
(ON state), or it does not use lactose at all (OFF state) 
[9]. This bistable behavior has drastic effects on the be- 
havior at the population level. When a high amount of 
external lactose was added to a previously uninduced cul- 
ture, all cells in the population switched from the OFF 
to the ON state. However, at lower sugar concentrations 
only a fraction of cells switched to the ON state while 



others remained in the OFF state EI] ■ 

Here, we are interested in the dynamics of such a 
switching process on the single cell level. We study these 
dynamics in the context of the arabinose utilization (ara) 
system of E. coli [TT] , another well-characterized bistable 
system, see Fig. [T] In this case, arabinose is imported 
by the high-affinity low-capacity transporter AraFGH 
and the low-affinity high-capacity transporter AraE. If 
internal arabinose exceeds a threshold concentration, it 
activates AraC, which in turn promotes expression of 
araFGH, araE and the genes for arabinose catabolism 
araBAD. Siegele and Hu Q2] analyzed population distri- 
butions at intermediate sugar levels, and revealed that 
the ara system displays an all-or-nothing expression pat- 
tern similar to the lac system. They conjectured that in 
uninduced cells the stochastic background expression of 
the ara regulon leads to a wide distribution of ara up- 
take proteins. Addition of arabinose would then lead to 
different rates of arabinose accumulation, causing hetero- 
geneous timing of gene induction within the population. 
At a given time there would be a fraction of induced 
and a fraction of uninduced cells, and the depletion of 
arabinose by the metabolism of the induced cells could 
explain the fixation of the all-or-nothing response. This 
conjecture is consistent with a computational study of 
autocatalytic expression systems |13j and experiments 
which placed araE under the control of a constitutive 
promoter, finding homogeneous gene expression in the 
population [T4HT6] . The dynamics of switching processes 
has also been studied using flow cytometry techniques, 
which yield a time series of population distributions of 
gene expression levels [T71 HH] . 

In this study we take a different experimental ap- 
proach: Rather than recording population distributions, 
we use quantitative time-lapse fluorescence microscopy to 
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FIG. 1: Regulatory network of the native arabinose utiliza- 
tion system [11] . including the g/p-reporter module used in 
this study. The system consists of genes for arabinose uptake 
(araE, araFGH), genes for arabinose metabolism (araBAD), 
and the regulator AraC. High amounts of intracellular ara- 
binose activate AraC, which stimulates expression from the 
promoters Pbad, Pe and Pfgh- In the absence of arabinose, 
AraC represses expression from Pbad (not depicted). Note 
that we also omitted the less pronounced negative autoregu- 
lation of AraC in the absence and presence of arabinose [43] . 
since this feedback mainly seems to provide a constant tran- 
scription factor abundance [52]. As indicated in light grey, 
in the mutant used in this study the chromosomal araBAD 
operon is deleted, and hence the additional negative feedback 
on internal arabinose is avoided. As a reporter for the expres- 
sion of the ara system we used a plasmid-borne gfp variant 
under the control of the Pbad promoter, see Materials and 
Methods for details. 



follow the expression dynamics of the switching process 
in many cells, individually. In a physics analogy, this is 
akin to following the trajectories of many particles, in- 
stead of recording their spatial density distribution at 
different time points. Clearly the distributions can be 
obtained from the trajectories, but not vice versa, i.e. 
the trajectories contain more information. In the present 
case, this additional information is particularly useful to 
disentangle different variables that affect the response of 
individual cells: The observed time-dependent fluores- 
cence level is the final output of a series of biochemical 
processes, which can be grouped into two connected sub- 
systems, an "uptake module" and a "reporter module" . 
Both modules experience noise, which, to a first approxi- 
mation, can be subsumed into a single parameter for each 
module. As we will see, one can extract these two param- 
eters for each cell by fitting an appropriate model to the 
"fluorescence trajectory" of the cell. As a result, we can 
directly obtain the separate distributions for these two 
parameters, and even measure their correlations. Note 
that this analysis would not have been possible based on 
population distributions of gene expression levels. 

Using this approach, we address the question raised 
by Siegele and Hu [T2], i.e. is the all-or-nothing response 
of the ara system associated with heterogeneous timing 



of gene induction, and, if so, is the heterogeneous tim- 
ing causally related to a wide distribution of ara uptake 
proteins? At subsaturating sugar levels, we observe a sig- 
nificant delay between addition of inducer and increase 
of fluorescence, which is indeed broadly distributed. To 
clarify the origin of this delay and its broad distribution, 
it is necessary to separate the intrinsic lag of the GFP 
expression dynamics from the time-lag inherent to the 
stochastic arabinose uptake. To this end, we leverage 
our microfluidic setup to separately measure the distri- 
bution of GFP maturation times across an E. coli pop- 
ulation. We also record the cell-to-cell variation of the 
growth rates. Using a simple quantitative model for the 
expression dynamics, we then extract the intrinsic tim- 
ing statistics for gene induction. We find that this dis- 
tribution is well described by an analytical delay time 
distribution derived from a stochastic model for the up- 
take module. Our results support the conclusion that the 
heterogeneous timing is indeed due to a wide distribution 
of ara uptake proteins across the population. 



Materials and Methods 

Bacterial strain and plasmid. E. coli strain LMG194 [_F — 
lacX74 galE galK thi rpsL phoA (PvuII) ara714 leur.TnlO] 
|19| was transformed with plasmid pBAD24-GFP (this work) 
using a standard method as described elsewhere 120) . The 
gene gJpmutS 1211 encoding the Green Fluorescent Protein GFP- 
mut3 was amplified by PCR with primers GFP-Kpnl sense (5'- 
TACCATGGTACCAAGTAAAGGAGAAGAACTTTTC-3') and 
GFP-Hindlll antisense (5'-CATAGTAAGCTTTTATTTGTATA- 
GTTCATCCATGCC-3') using plasmid pJBA29 [22] as a tem- 
plate. The DNA-fragment was cut with restriction endonucleases 
Kpnl and Hindlll, and was then ligated into similar treated vector 
pBAD24 [19], resulting in plasmid pBAD24-GFP. The correct in- 
sertion of the fragment was verified by restriction analysis as well 
as by DNA sequence analysis. 

Growth conditions. Cells were grown in LB medium 23 or M63 
minimal medium 19 containing 0.2 % (w/v) glycerol as C-source. 
When indicated, 0.01%, 0.02%, 0.05% or 0.2% (w/v) arabinose was 
added to induce GFP expression. Bacteria were inoculated from 
single colonies grown on LB agar plates and grown overnight (37° C, 
shaking at 300rpm) in M63 medium. Overnight cultures were di- 
luted 1:50 into fresh M63 medium and cultured for 2h. Bacteria 
were subsequently diluted in prewarmed medium to an appropriate 
density and were then applied to one channel of a Poly-L-Lysine 
coated microfluidic chamber (/x-Slide VI, ibidi, Martinsried, Ger- 
many). The slide was then incubated at 37° C for several minutes. 
By softly flushing the channel with prewarmed medium supple- 
mented with the desired arabinose concentration, gene expression 
was induced and the sample was rinsed at the same time. After 
the preparation procedure the vast majority of the bacteria adhered 
with their long axis parallel to the surface. 

Time-lapse microscopy. Time-lapse experiments were per- 
formed on a fully automated inverted microscope (Axiovert 200M, 
Zeiss, Oberkochen, Germany) equipped with a motorized stage 
(Prior Scientific, Cambridge, UK). All devices were controlled by 
Andor IQ software (Andor, Belfast, Northern Ireland). Fluo- 
rescence illumination was provided by an X-citel20 light source 
(EXFO, Quebec, Canada). An appropriate filter set (excitation: 
470/40; beamsplitter 495; emission: 525/50; filter set Nr 38, Zeiss, 
Oberkochen, Germany) was used. Bright field and fluorescence im- 
ages of several fields in one sample were acquired every 5 min with 
a highly sensitive EMCCD camera (iXon DV885, Andor, Belfast, 
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Northern Ireland) through an oil-immersion lOOx plan-neofluar ob- 
jective with NA 1.3 (Zeiss, Oberkochen, Germany), with acquisi- 
tion times of 0.1s to 0.2 s. To further prevent photobleaching and 
photodamage all light sources were shuttered between exposures 
and an orange filter was used in the bright field light path. The 
temperature in the sample environment was maintained at 37° C 
using a custom built heating box. Focalcheck fluorescence micro- 
spheres (Invitrogen, Karlsruhe, Germany) were used to correct for 
output variations of the lamp. 

Data Analysis. ImageJ 1241 and Igor Pro 4.0 (WaveMetrics, 
Lake Oswego, OR) were used for data analysis. Cell outlines were 
created by thresholding the bright-field images. Total fluorescence 
was measured as the sum over all pixel values within the outline in 
the corresponding background corrected fluorescence image. Time 
traces were assembled by tracking the cells manually. As photo- 
bleaching was found to be negligible for the given experimental 
system, fluorescence traces were fitted without further processing. 

Measurement of the GFP maturation time distribution in vivo. 
The maturation time in single cells was determined using an ap- 
proach similar to the one established in Ref. [25] : Translation was 
blocked by the addition of 200 fj,g/ml chloramphenicol, 30 min after 
the induction of g/p-expression with 0.2% arabinose. Fluorescence 
images were acquired every 3 to 5 min before and after inhibition. 
As this measurement was more sensitive the illumination was re- 
duced and the EM Gain of the camera was used. Photobleaching 
could thus again be neglected. Cellular fluorescence was deter- 
mined by summing all pixel values above the background level for 
each bacterium. This method is qualitatively equal to the use of 
cell outlines as described above, but can only be applied if the 
range of fluorescence values is limited and bacteria do not grow 
strongly. The resulting maturation time courses were fitted by an 
exponential function. 



Results 

Single Cell Induction Kinetics 

To study the induction kinetics of the ara system, we 
use an E. coli strain where both araBAD and araC are 
deleted [19] . It is transformed with the reporter plasmid 
pBAD24-GFP, containing the araC gene and the rapidly 
maturing GFP variant gfpmutS |21| which is under the 
control of the Pbad promoter, see Materials and Meth- 
ods. The araC gene is supplied on the plasmid to guar- 
antee full functionality of the DNA loop required for re- 
pression of Pbad in the absence of arabinose [TT] and 
to provide the proper stoichiometry of transcription fac- 
tors and Pbad promoters. The chromosomal deletion 
of araBAD avoids the negative feedback of the internal 
arabinose catabolism. This feedback complicates the sys- 
tem, but is irrelevant for our questions, which focus on 
the kinetics of the induction when arabinose first becomes 
available externally. The gene regulatory circuit of our 
system is illustrated in Fig. [I] 

To perform the time-lapse fluorescence microscopy, we 
introduce the bacteria into a microfluidic chamber, where 
they attach to the Poly-L-Lysine coated chamber wall. 
The microfluidic chamber provides homogeneous external 
conditions for the bacteria and can be used to rapidly ex- 
change the medium. At t=0 min, we induce the bacteria 
with 0.2% (13.3 mM), 0.05% (3.33 mM), 0.02% (1.33 mM) 
or 0.01% (0.66 mM) arabinose, and then record the time- 
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FIG. 2: Examples of single cell induction kinetics of the arabi- 
nose utilization network. Cells were induced at t=0min with 
0.2% arabinose (a) and 0.01% arabinose (b) (empty circles). 
The traces were analyzed up to the first cell division, which 
results in different numbers of data points in the traces. Fits 
of the deterministic gene expression function in Eq. [I] to the 
data are shown as solid lines. The image panels in (a) and 
(b) correspond to the fluorescence traces marked with green 
circles, respectively. The total fluorescence was determined 
within the white outlines created via thresholding of the re- 
spective bright field images and is given in fluorescence units 
(FU). 



evolution of GFP fluorescence in single cells. Representa- 
tive fluorescence "trajectories" for the highest (0.2%) and 
the lowest (0.01%) arabinose concentration are shown in 
Fig. [2] (a) and (b), respectively. 

For all arabinose concentrations, the individual time- 
traces of each cell appear rather smooth and determinis- 
tic, whereas there is a significant variation in the response 
from cell to cell. We also observe a time lag between 
the addition of arabinose and the onset of fluorescence. 
With decreasing arabinose concentration, the typical lag 
time becomes longer, and its cell to cell variation becomes 
more pronounced. Below, we will devise a rigorous way to 
quantify this delay. Here, we only apply a simple thresh- 
olding procedure to extract an apparent lag time. Using 
an intensity threshold of 2.5- 10 4 fluorescence units (FU), 
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we determine an apparent lag time of 16 ±2.5 min at 0.2% 
arabinose and a more substantial delay of 34 ± 10 min at 
0.01% arabinose. In the latter case approximately 10% 
of the bacteria do not show any fluorescence within our 
time window of 70 min. 

With the sudden increase of the external arabinose 
concentration at t=0 min, a cascade of biochemical pro- 
cesses is triggered, culminating in the fluorescent output 
signal measured in our experiment. In order to narrow 
down the origin of the stochasticity in the apparent lag 
time, we need to analyze the individual steps in this cas- 
cade. For this analysis, it is useful to separate the system 
into two distinct modules, an uptake module and a GFP 
expression module, as depicted in Fig. [3] (a). The up- 
take module not only comprises arabinose import (rep- 
resented here by an effective uptake protein "Upt" that 
subsumes transport by AraE and AraFGH), but also in- 
cludes the positive feedback of arabinose on the uptake 
protein. The expression module turns ON the production 
of the output signal, when internal arabinose reaches a 
threshold level [26 . The delay time td that is required 
to reach this threshold is solely determined by the up- 
take module. However, GFP fluorescence does not follow 
promoter activation instantaneously. Instead, the pro- 
cesses of transcription, translation and GFP maturation 
depicted in Fig. [3] (b) also generate a dynamical delay 
and thereby contribute to the apparent delay estimated 
above. To quantitatively estimate the intrinsic delay td 
and its statistics, we now scrutinize the expression mod- 
ule in detail. 



Quantitative characterization of the expression 
module 



GFP maturation time. A significant portion of the 
dynamic delay of the expression module is incurred by 
GFP maturation, the process whereby the folded protein 
becomes fluorescent. The rate-limiting reaction is an oxi- 
dation with a time constant of several minutes up to sev- 
eral hours |27j . depending on the variant of the protein 
and possibly on the organism. However, for our present 
purpose, we not only need the average time constant, but 
also need to know whether there is a large cell-to-cell vari- 
ation associated with the maturation process. With our 
microfluidic setup, we can directly probe this cell-to-cell 
variation experimentally, under the same conditions as 
in the induction experiments. First, we induce bacteria 
with 0.2% arabinose and then inhibit protein synthesis in 
situ by flushing the channel with the antibiotic chloram- 
phenicol. The resulting fluorescence trajectories cease to 
increase about 15 min after the addition of the antibiotic, 
see Fig. [4] (a) for a few representative trajectories. Fol- 
lowing the rationale established in Ref . [35] , this behavior 
reflects the maturation dynamics of the remaining, non- 
fluorescent GFPs. The distribution of time-constants r m 
of GFP maturation shown in Fig. [4 (b) was obtained 
from exponential fits to 77 single-ce 1 timeseries (solid 
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FIG. 3: (a) The arabinose utilization system can be dissected 
into an arabinose uptake module (left) and a gene expression 
module (right). The gene expression module is turned ON, if 
the internal arabinose level exceeds the threshold required for 
activation of the Pbad promoter. The stochastic model for 
the uptake module comprises arabinose import by a heuristic 
uptake protein and the positive feedback of arabinose on the 
synthesis of the uptake protein, see Appendix B for all details. 
The model for the expression module encompasses the pro- 
cesses depicted in (b) and describes the accumulation of total 
fluorescent GFP per cell, see Appendix A for the deterministic 
rate equations. 



lines in Fig. [4] (a)). We find an average maturation time 
of r m = 6.5 min and a standard deviation of 0.6 min, i.e. 
a cell-to-cell variation of only about 10%. 

Our finding of a relatively small cell-to-cell variation 
suggests that the maturation process is largely indepen- 
dent of the internal state of the cell in E. coli. This 
appears plausible, given that the oxidation reaction does 
not depend on intracellular components |27) . For com- 
parison, measurements of the maturation times of YFP 
and CFP in yeast [25 found considerably longer matu- 
ration times of ~ 40 minutes, but only a slightly larger 
relative cell-to-cell variation (15 — 20%). Moreover, from 
in vitro measurements of various YFP variants, oxidation 
timescales as low as 2-8 minutes were determined [35J , in- 
dicating that the rapid maturation time detected in our 
experiment is conceivable in vivo. 

Gene copy number. Since our GFP reporter is en- 
coded on a plasmid, the average copy- number of the plas- 
mid and its cell-to-ccll variation arc important properties 
of the expression module. The plasmid pBAD24 has an 
average copy number comparable to pUC [29] . which is 
present in about 55 copies per cell |30j . Assuming plas- 
mid production and dilution with constant rates, we ex- 
pect Poissonian fluctuations on the order of \/55 ~ 7.5 
plasmids (13 %). In similar plasmids, ColEl and Rl, neg- 
ative feedback is known to reduce the copy-number varia- 
tions below the Poisson limit |3l . This may also apply to 
pBAD24, which would make the variation even less sig- 
nificant. We expect that the plasmid copy number grows 
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FIG. 4: GFP maturation kinetics in single cells. In (a) GFP 
expression was induced with 0.2% arabinose at t=0min and 
protein synthesis was inhibited by addition of 200 fig /ml chlo- 
ramphenicol at t=30mfn, as indicated by the arrow. Expo- 
nential fits to the fluorescent timeseries (solid lines) yield the 
maturation-time distribution in (b). The statistics was ob- 
tained from 77 cells. 



proportional to the volume of the cell, such that the con- 
centration of plasmids remains constant. Hence, we will 
assume that the rate 7 of gene replication in Fig. [3] (b) 
equals the rate of volume expansion of the cells. 

Cell growth. As the above discussion of the gene copy 
number shows, the distribution of growth rates is an- 
other characteristic affecting the quantitative properties 
of the expression module. We analyzed the growth of 
individual cells in the microfluidic channel by recording 
the time-evolution of their area detected under the mi- 
croscope. Since the rod-shaped E. coli cells grow mainly 
along their principal axis (cf. image panels in Fig. [2]), 
the growth rate of the cell area is a proxy for the growth 
rate by cell volume. From exponential fits [33] to 84 
timeseries of the cell area we found a distribution of time 
constants for cell growth with an average of 50 min and a 
standard deviation of 6 min. Hence, the cell-to-cell vari- 
ations of the growth rate is also relatively small. This 
result indicates that the microchemical conditions in our 
channel are sufficiently constant to guarantee a repro- 
ducible growth state of the cells. We also found that 
the doubling time was independent of the arabinose con- 
centration, consistent with the fact that in this strain 
arabinose cannot be catabolized and used as an energy 
source. 

mRNA half-life and protein expression rate. Finally, 
the dynamics of the expression module is dependent on 
the rate constants for gfp expression and mRNA degrada- 
tion. Average mRNA half-lifes were determined for most 



of E. coifs genes [33] and are typically in the range 3 to 
8 min. The work of Smolke et al. [34] indicates that the 
population-averaged half-life of gfp mRNA is in the same 
range; for our analysis below we will assume an average 
half-life of 6 min. In contrast, there is currently no report 
on the cell-to-cell variation of gfp mRNA half-lifes. We 
expect that such a variation would mainly be produced 
by cell-to-cell variations of RNase abundance and other 
components required for transcript turnover. These com- 
ponents have been shown to vary with the growth rate 
[35] , Since the growth rate varies only by ~10% from 
cell to cell in our experiment (see above), we estimate 
the relative cell-to-cell variations of mRNA half-life to be 
similar. This may be an overestimate, since the degrada- 
tion machinery negatively autoregulates its own expres- 
sion [36) . a mechanism known to reduce gene expression 
noise [37] . 

The protein expression rate has been quantified exper- 
imentally at the single-cell level for the Pr promoter of 
phage A, and substantial cell-to-cell variations on the or- 
der of 35% were determined [38] . These large relative dif- 
ferences likely stem from cell-to-cell variations in global 
cellular components such as RNA polymerases or ribo- 
somes. We expect similar variations for GFP expression 
from the Pbad promoter. 

Distribution of GFP expression rate and intrinsic 
delay time 

Given the above characterization of the expression 
module, we can now construct a simple quantitative 
model for its dynamic response, and then use this model 
to extract the intrinsic delay td- The smooth shape of 
the timeseries in Fig. [2] suggests that the dynamics of 
individual cells follows a rather deterministic fate, while 
the differences between the cells stem from cell-to-cell 
variation of the reaction rates. Therefore we use a deter- 
ministic rate equation model to describe the expression 
dynamics within a single cell, but allow for cell-to-cell 
variation in the model parameters. This model follows 
the reaction scheme depicted in Fig. [3] (b) : Transcription 
of gfp mRNA from the promoter Pbad is turned ON at 
t = td and then remains constant at rate a x . However, 
the number of plasmids (and hence gene copies) increases 
with rate 7, which equals the cell doubling rate, so that 
the plasmid copy number P remains stable in the bacte- 
rial population. We denote the mRNA degradation rate 
by \ x , and the translation and maturation rates of GFP 
by a y and r" 1 , respectively (see Appendix A for details). 

Within this model, the time-evolution of the total num- 
ber of fluorescent GFP molecules in a cell, Z(t), is de- 
scribed by the expression 
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FIG. 5: (a) Histograms of the time delay td for varying ex- 
ternal arabinose concentrations, as determined from the fits 
of Eq. [l]to the fluorescence timeseries. The mean (td) as well 
as the standard deviation a TD gradually decrease for increas- 
ing arabinose levels. Note that for 0.01% arabinose approx- 
imately 10% and for 0.02% arabinose approximately 5% of 
the cells did not turn on gene expression within our experi- 
mental time window. Therefore the extraced means and stan- 
dard deviations (see insets) constitute slight underestimates 
in these cases. The solid lines are fits of the analytical delay 
time distributions (Eq. [3J to the data, for details see text. 
The statistics was obtained from 71 cells at 0.01%, 90 cells at 
0.02%, 76 cells at 0.05%, and 101 cells at 0.2% arabinose. (b) 
Cumulative distributions of the delay times rescaled to their 
mean values (td)- A two-sample KS-test indicates that all 
rescaled distributions are likely to be drawn from the same 
underlying probability distribution. The p- values of the indi- 
vidual pairs are 0.25 for (0.01% and 0.02% arabinose), 0.87 
for (0.01% and 0.05% arabinose), 0.54 for (0.01% and 0.2% 
arabinose), 0.90 for (0.02% and 0.05% arabinose), 0.08 for 
(0.02% and 0.2% arabinose), and 0.67 for (0.05% and 0.2% 
arabinose). The analytical prediction is shown for [s, = 3.8, 
b = 30, and to = 2100 min (bold line). 



where r = t — tjj is the time after transcription is 
switched ON, a p = Pa x a y / (7 + A^) is a lumped constant 
giving the protein synthesis rate in fluorescence units per 
minute [FU/min], and Zq is a constant determined by the 
initial conditions. Here, the first two terms in parenthe- 
ses describe transients associated with the equilibration 
of the GFP maturation process and the mRNA degrada- 
tion reaction, respectively, i.e. their contributions decay 
exponentially with time constants r m and A" 1 . In the 
long-time limit the last, exponentially increasing term 
is dominant. It reflects the constant protein production 
from an exponentially growing number of plasmids, and 
describes the long-time behavior of the total fluorescence 
per cell. However, since we study the dynamics of gene 
expression during the first cell cycle after induction, all 
terms, including the transients, are relevant. 

From the previous section, we conclude that the pa- 
rameter a p , comprising the plasmid copy number and the 
protein expression rate, captures most of the cell-to-cell 
variation within the expression module. To fit the model 
in Eq. [T] to the single-cell induction kinetics, we there- 
fore fixed the remaining parameters to their population- 
averaged values. Hence, in the optimization procedure 
of the fit, we only allow the adjustment of a p and the 
uptake-induced delay td, which we seeked to extract. 
Note that this choice fixes all relevant timescales gov- 
erning the dynamics in Eq. [T] and the free parameters 
only impose shifts in the onset (td) and in the absolute 
magnitude (a p ) of g/p-expression. 

We fitted the timeseries of cells induced with various 
levels of arabinose (0.2%, 0.05%, 0.02%, and 0.01%). A 
few representative fitted curves for the highest and low- 
est concentration are plotted in Fig. [2] as solid lines. 
The resulting histograms for the delay time are shown 
in Fig. [5] (a). For the lowest arabinose level (0.01%, up- 
per panel) we find that the delay times are distributed 
between 5 and 50 min with a mean and standard devi- 
ation of (td) = 23 min and o~ Td — 10 min, respectively. 
With increasing arabinose concentration both the mean 
and the standard deviation of the delay time distribu- 
tion decrease gradually, until at the highest arabinose 
level (0.2%, lower panel) a distribution with (td) = 4.1 
min and o~ TD = 2.2 min is reached. 

To test whether there is a relationship between the de- 
lay time and the protein synthesis rate, we calculated 
their cross-correlation coefficients for all inducing arabi- 
nose levels, see Fig. [6] (a). Only in the case of 0.02% 
arabinose a slight anticorrelation was detected, whereas 
for all other concentrations the correlation coefficient is 
close to zero [p- values for finding the observed correlation 
coefficients by chance in an uncorrelated sample: 0.68 
for 0.01% ara, 0.03 for 0.02% ara, 0.73 for 0.05% ara, 
and 0.72 for 0.2% ara]. We also find that the distribu- 
tion of g/p-expression rates itself does not vary systemat- 
ically with the inducing arabinose concentration, and all 
distributions fall on top of each other when rescaled by 
their mean values, see Fig. [6] (b) [pairwise Kolmogorov- 
Smirnov (KS) tests yield significance levels between 0.57 
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and 0.97 for the null hypothesis that the data sets are 
drawn from the same underlying distribution]. In sum- 
mary, the low correlations between tjj and ap on the one 
hand and the independence of ap on the inducing arabi- 
nose level on the other hand, suggest that the uptake and 
the expression module are indeed functionally separate. 
Note that our experimental approach with time-lapse flu- 
orescence microscopy was crucial for these results, which 
would have been impossible to obtain with flow cytome- 
try. 



Stochastic model for the uptake module 

Next, we want to assess whether the extracted delay 
time distributions of Fig. [5] (b) may be causally linked to 
a broad variation in the number of uptake proteins. We 
approach this question with the help of a simple stochas- 
tic model for the "uptake module" depicted in Fig. [3] (a). 
The model is useful in three respects: (i) It serves us to 
illustrate the mechanism whereby stochastic expression 
of the uptake protein genes can produce a broad distri- 
bution of delay times. We will see that according to this 
mechanism, the delay time distributions for different in- 
ducer concentrations should be related by simple linear 
rescaling of the time axis. Thus, we will test for this sig- 
nature of the mechanism in our experimental data, (ii) 
Since most model parameters are strongly constrained by 
literature values, we can test whether an interpretation 
of our data based on the stochastic model is consistent 
with these constraints, (hi) Independent of the precise 
choice of parameter values, which affect the average de- 
lay time and its standard deviation, the model predicts 
a certain shape for the delay time distribution. We will 
test whether this shape is compatible with our data. 

There are two distinct transport systems for arabinosc 
uptake, AraE and AraFGH. However, the two systems 
are coupled, and it was found that arabinose uptake can 
effectively be described as a single Michaelis-Menten pro- 
cess [35] • In the sketch of Fig. [3] (a) , this combined trans- 
port system is represented by a single gene u upt ' . In ad- 
dition to the transport, the uptake module of Fig. [3] (a) 
comprises the activation of AraC by internal arabinosc, 
the subsequent stimulation of transcription by the acti- 
vated complex, and the translation into functional up- 
take protein. Within our stochastic model for the uptake 
module, we describe and simulate all of these processes 
in standard ways, see Appendix B for details. 

Figs. [7] (b) and (c) show the simulated time-evolution 
of the level of uptake proteins and the level of internal 
arabinose upon induction with 0.01% external arabinose 
for a few representative simulation runs. These trajecto- 
ries illustrate the mechanism leading to a broad distribu- 
tion of delay times within our model: Internal arabinosc 
initially accumulates approximately linearly in time, and 
the accumulation accelerates only after reaching the ef- 
fective arabinose threshold of a w 50 \iM for activation 
of the araBAD and upt promoters, which is indicated by 
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FIG. 6: (a) Correlations between delay time and protein 
synthesis rate a p . The scatter plots display small correla- 
tion coefficients R, and the respective p-values for observing 
these correlations by random chance are 0.68 for 0.01%, 0.035 
for 0.02%, 0.73 for 0.05 %, and 0.72 for 0.2% arabinose. (b) 
The cumulative distributions of the protein synthesis rate a p 
were rescaled to their mean values (a p ) to exclude sample- 
to-sample variations of the mean. Importantly, we found no 
correlations between (a p ) and the inducing arabinose concen- 
tration. A two-sample KS-test shows that all rescaled distri- 
butions of a p are compatible with each other. The significance 
levels of the pairwise KS-tests varied between 0.57 and 0.97. 



the black horizontal line in Fig.[7|(c). The time delay, tjj, 
caused by the uptake module is the time required for the 
internal arabinose concentration to reach this threshold 
level. The rate of arabinose import, given by the slope in 
Fig. [7] (c), is proportional to the number of uptake pro- 
teins n in Fig. [7] (b) . If arabinose import is fast compared 
to the timescale of changes in the protein abundance, the 
delay time is given by the simple relation td = o-a/iyon), 
where the arabinose uptake rate per uptake protein, Vq, 
depends on the external arabinose concentration. Thus, 
the distribution of uptake proteins in Fig. [7] (a) directly 
determines the distribution of import rates, which in turn 
are inversely proportional to the delay times, resulting in 
the distribution of delay times shown in Fig. [7] (d). 
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FIG. 7: Illustration of the stochastic arabinose uptake mech- 
anism at 0.01% external arabinose (simulation). The three 
representative time-courses of arabinose uptake proteins in 
(b) and internal arabinose in (c) illustrate that the rate of 
arabinose uptake is proportional to the amount of uptake pro- 
tein present at a given time. Once the internal threshold for 
activation of the promoters Pbad and P up t is reached, the 
positive feedback gets activated and is visible as the kinks in 
(b) and (c). The delay time distribution in (d) (grey bars) is 
obtained by measuring the time to reach this threshold. If 
the uptake proteins decay much slower than the typical time 
required to reach the threshold (adiabatic limit), the delay 
time distribution in (d) can be related to the steady state 
distribution of uptake proteins at zero arabinose in (a). The 
analytical predictions in (a) and (d) (bold lines) axe shown for 
H — 3.8, b = 30, and to = 2100 min (for details see text). 



A simple prediction of this mechanism is that an in- 
crease of the uptake velocity vq will reduce all delay times 
within a distribution of cells by the same factor. In other 
words, the delay time distributions for different arabi- 
nose levels (and hence different vo) should fall on top of 
each other upon simple linear rescaling of the time axis 
(and restoring normalization). In Fig. [5] (b) we test this 
prediction on our experimental time delay distributions. 
We find that after rescaling to the same mean value, the 
cumulative distributions are congruent with each other. 
This agreement is also quantitatively supported by pair- 
wise KS-tests, which test whether the samples are likely 
to be drawn from the same underlying distribution (the 
legend to Fig. [5] shows the respective significance levels) . 
Note that the linear scaling of the time axis with 1/vq 
does not imply linear scaling with the arabinose level, 
since vq depends nonlinearly on the external arabinose 



level, see also further below. 

In order to relate the experimentally observed shape of 
the distribution to the prediction of the stochastic model, 
we will now derive an analytical expression for the delay 
time distribution. To this end, we first consider only in- 
trinsic noise and study the effect of extrinsic noise below. 
Before the addition of the inducer arabinose, expression 
of the uptake proteins is a completely random, unregu- 
lated process. Following the work of Berg [40] and under 
the assumptions stated in Appendix B, we find a steady- 
state distribution P(n) for the number of uptake proteins 
n of the form 



P(n) 



l + b 



n 



(2) 



which is sometimes referred to as a 'negative binomial'. 
Here, the ratio b = v p /\ m of the translation rate and 
the mRNA degradation rate corresponds to the typical 
number of proteins produced from a single mRNA and is 
also known as the "burst size" [31]. The ratio /i = v^/Xp 
of the basal transcription rate and the protein dilution 
rate can be interpreted as a dimensionless "burst fre- 
quency" (the number of bursts within the lifetime of a 
protein). Both parameters determine the mean (n) = fib 
and the variance Sn 2 — (n)(l + b) of P(n). Fig. [7] (a) 
shows the steady-state distribution P(n) obtained from 
our stochastic simulations of the uptake module (grey his- 
togram) together with the analytical expression in Eq. [2] 
for the same rate constants. The excellent agreement 
suggests that the assumptions leading to Eq. [2] are all 
satisfied in the relevant parameter regime. 

Next, we study the effect of extrinsic noise which leads 
to a variation of reaction parameters from cell to cell. 
An experimental characterization of extrinsic noise in E. 
coli |38j found a typical parameter variation of ~ 20%. 
When we adopt this level of extrinsic noise for all param- 
eters in our stochastic simulations, the resulting protein 
distribution has a significantly larger standard deviation 
than the distribution in the absence of extrinsic noise, 
while the mean remains almost unchanged, see Suppl. 
Fig. SI. However, the protein distribution in the pres- 
ence of extrinsic noise is still well fitted by Eq. [2] with 
an increased effective burst size and a reduced effective 
burst frequency. Keeping this in mind, the following re- 
sults can be generalized to the realistic scenario where 
extrinsic fluctuations are present. 

To obtain an approximation for the delay time distri- 
bution, we assume that arabinose uptake is rapid com- 
pared to the typical timescale of changes in the protein 
abundance. In this adiabatic limit, the delay time is in- 
versely proportional to the current protein abundance in 
each cell, i.e. tjj = tq/u, where tq = clq/vq is the time 
for a single uptake protein to accumulate arabinose to the 
threshold level a . With this relation, the steady-state 
uptake protein distribution (Eq. [2]) leads to a delay time 
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FIG. 8: Estimated parameters as a function of external ara- 
binose, as obtained from fits of the delay time distributions 
in Fig. [5] (a). The timescale of arabinose accumulation to /fa 
in (a) decreases monotonically with the inducing arabinose 
concentration, as expected from saturating Michaelis-Menten 
kinetics of the uptake proteins, cf. the Lineweaver-Burk plot 
(inset) for the scaling with the inverse arabinose concentra- 
tion. In contrast, the burst frequency in (b) is constant 
for all arabinose levels. This is consistent with our central 
assumption that the underlying uptake protein distribution 
responsible for the heterogeneous timing is independent of 
the inducing arabinose concentration. 



distribution of the form 



Q(td) 



T D 



TO /-TO 



r(r /r D + /i) 



T(t /t d + l)T(p) 

where T(x) is the Gamma function. In Fig.[7](d) we com- 
pare this analytical prediction (black line) to the stochas- 
tic simulation (grey bars). The small deviation stems 
from the fact that the number of uptake proteins is not 
constant over the period of the time delay*. The mean 
and variance of the delay time distribution can be ap- 
proximated by 



v~d) 



St* 



(n) 



Sn 2 



T0_ 
jib 

: 

i' 



(4) 



see Appendix B. From these expressions it is clear that 
the model has two key parameters, which together de- 
termine the mean and width of the delay time distribu- 
tion: the time required to reach the internal arabinose 
threshold by a single protein burst, to/6, and the burst 
frequency fi. 

Now we test whether the shape of the delay time dis- 
tribution predicted by the model is quantitatively con- 
sistent with our experimental distributions. To this end, 
we fit the model in Eq. ^ to the data in Fig. [5] (a) by 
varying the two key parameters identified above. The re- 
sulting fits (solid lines) display good agreement with the 
experimental data, as indicated by one-sample KS-tests 
under the null hypothesis that the samples are drawn 
from the analytical distribution. The significance levels 
are 0.50, 0.47, 0.77, and 0.07 for 0.01%, 0.02%, 0.05%, 
and 0.2% arabinose, respectively. Only in the case of 

0. 2% arabinose the test points to a significant difference 
between the theoretical and experimental distribution. 
However, for this concentration the estimated delay times 
are very short, such that the error of the estimation it- 
self is likely to account for the deviations. Note that the 
two-parameter fit guarantees that the mean and stan- 
dard deviation of the experimental and theoretical dis- 
tribution will match. However, the fact that the shape of 
the distributions show excellent agreement is a nontriv- 
ial result, suggesting that the discussed delay mechanism 
can indeed explain our observations. 

Finally, we address the consistency of the parame- 
ter values. Fig. [8] shows the estimated parameters as 
a function of the external arabinose concentration: The 
timescale To/b of arabinose accumulation in (a) decreases 
monotonically as a function of external arabinose and 
saturates for large sugar abundances, whereas the burst 
frequency \x in (b) is constant for all arabinose levels. 
This observation is consistent with the idea that the un- 
derlying protein distribution, characterized by /i and 6, is 
independent of the externally provided sugar concentra- 
tion, and that the differences in timing can be explained 
by shifts in the effective arabinose uptake velocity per up- 
take protein, vq: By assuming simple Michaelis-Menten 
saturation kinetics for vq, one expects that To scales in- 
versely with the external arabinose concentration [a ex ], 

1. e. t = ao/v m ax (1 + K m /[a ex ]), where v max denotes 
the maximal uptake velocity per uptake protein and K m 
the Michaelis constant. This behavior is indeed found in 
Fig. [8] (a) (inset) and with the resulting values for v max , 
K m and a typical value of b = 30 for the burst factor 
[41] . all parameters are compatible with the experimen- 
tally constrained ranges discussed in Appendix B. 



* Indeed, if the protein dynamics is much faster than the character- 
istic time of arabinose uptake (Xp 1 <C tjj), every cell experiences 
simply the average abundance of uptake protein (n) and the delay 
time distribution approaches a sharply peaked function around 
To = To(n)~ 1 (data not shown). In our case, Xp 1 £3 70min is 
much larger than the average delay times, so that the assumption 
of a constant n is sufficiently accurate. 



Discussion 

We studied the expression dynamics during induction 
of the bistable arabinose utilization system in single E. 
coli cells using quantitative time-lapse fluorescence mi- 
croscopy. Upon addition of arabinose, we observed a 
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characteristic time delay before the cells switched from a 
state of basal expression to a state of high expression of 
the ara regulon. This typical duration of the delay ex- 
hibited a systematic dependence on the externally sup- 
plied arabinose concentration: At a saturating arabinosc 
level, we found rapid induction within all cells of the cul- 
ture, whereas with decreasing levels, we detected a signif- 
icant broadening and shift of the delay time distribution 
function. In order to characterize the cell-to-cell vari- 
ability in the cellular response, we dissected the system 
into an uptake module with stochastic behavior, and an 
expression module which displays virtually deterministic 
behavior in individual cells. We first studied the expres- 
sion module, in particular by measuring the cell-to-cell 
distribution of the GFP maturation time. To the best 
of our knowledge, this constitutes the first measurement 
of a maturation time distribution in bacteria. We then 
developed a hybrid deterministic/stochastic theoretical 
model to analyze our experimental data. The model is 
based on the assumption that the initial basal expres- 
sion of the arabinose transporters determines the rate of 
arabinose uptake. Adopting the approach of Berg, we 
find an analytic expression for the distribution of trans- 
porter proteins and the distribution of delay times. The 
theory consistently fits the shape of the experimental de- 
lay time distributions for various inducer concentrations. 
Hence our data support a previous conjecture by Siegele 
and Hu [T2], according to which the delay time distri- 
bution is causally linked to the distribution of uptake 
proteins in the absence of the inducer. To corroborate 
our model even further it would be interesting to control 
the level of transporter proteins independently, e.g. by 
using an inducible promoter that is independent of ara- 
binose. Also, it remains an open question how the two 
transport systems are coupled. It appears that the high- 
affinity low-capacity transporter araFGH and the low- 
affinity high-capacity transporter araE are orchestrated 
to respond like a single protein. A similar analysis to 
ours using knockout mutants in one of the two transport 
systems could shed light on this matter. 

In general, we determined the dynamic response of bac- 
teria to an external change of food conditions. Since such 
decisions are of vital importance to living systems, we can 
speculate about their impact on the fitness of a bacterial 
population. The observed heterogeneous timing in gene 
induction may simply be a fortuitous consequence of the 
evolutionary process that shaped the arabinosc utiliza- 
tion system in E. coli. Alternatively, it may be beneficial 
for a bacterial colony, if the individual cells respond at 
different times when arabinose suddenly becomes avail- 
able in modest amounts. Note that in our experiments 
with the araBAD deficient strain, even the lowest ara- 
binose level, if maintained over a long time, ultimately 
induces the ara system in almost all cells. However, for 
a wild-type strain in an environment where arabinosc 
availability may fluctuate, temporal disorder of gene in- 
duction could provide selective advantages for the colony 
as a whole. For instance, it might be beneficial to pre- 



vent costly synthesis of the arabinose system in all cells 
when the sugar level is only moderate and may soon be 
depleted. Our analysis indicates that the delay time dis- 
tribution of the system can be readily tuned over evolu- 
tionary timescales, by adjusting the burst frequency and 
burst size of the uptake proteins. In the future, it will 
be interesting to further explore the possible connections 
between the system design in individual cells and the bi- 
ological function at the population level. 

Appendix 

Appendix A: Deterministic GFP expression model 

To extract the intrinsic time delay Trj from our single cell expres- 
sion data, we employ a simple deterministic model that follows the 
scheme depicted in Fig. [3] (b). We assume that the transcription 
rate from the promoter Pbad is zero until the internal arabinosc 
threshold for activation of Pbad is reached at t = rrj. Then, the 
promoter activity jumps to its maximal value a x . The correspond- 
ing rate-equations for the total abundance of plasmids (P), gfp 
mRNA (X), immature GFP protein (Y), and mature GFP protein 
(Z) per cell, are 

d t P = ~/P 
d t X = a x P-\ x X 
dtY = ayX-r^Y 
d t Z = r~ l Y. 

with the cell-doubling rate 7 and the rate for transcription a x , 
translation ct y , maturation r^ 1 , and mRNA degradation A^. Note 
that the model does not include dilution due to cell growth, since 
we measured the total fluorescence per cell in our experiments. 
Therefore the number of plasmids (number of gene copies) increases 
exponentially in time, keeping the number of genes per volume 
constant. Solving these equations for Z(t) leads to Eq. [l]in the 
main text. 

Appendix B: Stochastic model for arabinose uptake 

The arabinose uptake module, see Fig. [3] (a), includes the pro- 
cesses for the uptake of arabinose as well as transcription, transla- 
tion, and turnover of uptake proteins. In the following we describe 
the chemical reactions included in the stochastic simulations used 
to generate Fig. [7] and Suppl. Fig. SI. We then derive an analyt- 
ical approximation for the delay time distribution and discuss the 
experimental constraints on the model parameters. 

Arabinose uptake. Comparison of arabinose uptake in wildtype 
strains with araE and araFGH deletion strains revealed that the 
two transporters do not operate independently |39| . Instead, ara- 
binose transport was best described by a single Michaelis-Menten 
function. Our model reflects this behavior of the wildtype strain 
through the use of a single "effective" uptake protein (referred to 
as Upt) for arabinose import, 

a ex + Upt 4^ a e x ■ Upt 
a ex ■ Upt v 2H2f a _|_ TJpt . 

The uptake protein binds external arabinose a ex with dissociation 
constant K m and, once bound, translocates it to the cytoplasm at 
rate v m a X ■ The effective uptake velocity per uptake protein is hence 
Vo = v max [a ex ]/(K m + [a ea: ]). Cytoplasmic arabinose is denoted 
by a. 
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Transcriptional regulation. The Pbad promoter in the ara- 
regulon is one of the best characterized bacterial promoters: In 
the presence of internal arabinose, AraC stimulates transcription 
from Pbad j while AraC represses transcription by formation of a 
DNA loop in the absence of arabinose jll] . When exceeding an 
arabinose threshold of ag fti 50 /iM, the promoter activity of Pbad 
increases cubically with the internal arabinose concentration [26| . 
In contrast to the detailed studies on Pbad > less i s known about the 
promoter activity function of the promoters Pb and Pfgh, which 
regulate expression of the transport proteins. Both promoters are 
also induced by internal arabinose, but lack an upstream AraC- 
binding site (required for DNA looping) and are not repressed in 
the absence of arabinose. Consequently, their basal expression level 
is higher than for Pbad and the fold-change is reduced from ~400 
for Pbad to ~150 for Pg and Pfgh 1421 . However, the detailed 
promoter activity as a function of internal arabinose is not known 
for these promoters. Apart from the lack of the AraC binding site 
required for DNA looping, the promoters Pg and Pfgh display a 
high similarity to Pbad 1431 . Therefore we model transcriptional 
regulation of the uptake proteins by introducing a heuristic pro- 
moter P up t, that has the same characteristics as Pbad, but lacks 
the repression in the absence of arabinose. To reproduce the cubic 
increase of the promoter activity function of Pbad we allow three 
arabinose molecules to bind AraC with dissociation constant Kq. 
This activated complex binds the promoter P up t with dissociation 
constant Kp and thereby switches the transcription rate from its 
basal rate v m to its maximal rate v m . The chemical reactions for 
transcriptional regulation are 

3a + C ^ 

(13 ■ C + P up t 



Hp 



a 3 -C 



Pupt 
Pupt 



a 3 -C 
a 3 -C- 

Pupt H" 
a 3 ■ C ■ P, 



upt 



upt 



Here the concentration of AraC molecules [C] is a variable that 
changes little over time |43l and is therefore assumed to be a 
constant parameter in our model. In steady state, the proba- 
bility for finding the promoter P up t in a transcriptionally acti- 



upt 

function of the internal arabinose concen- 
= [a] 3 / (K C K P /[C] + [a] 3 ). We define the 



* upt 



vated state is a Hill 
tration, [a 3 ■ C ■ P up t] 

effective arabinose threshold for activation of P,,,-nt as K„ 
(KcKp/IC]) 1 ' 3 . 

Translation & turnover. mRNA is translated into functional 
uptake protein at rate v v and gets degraded at rate A m . In contrast, 
the uptake proteins and arabinose are only diluted by cell growth 
at doubling rate 7: 



Upt 



Upt 



Delay time distribution. Following Berg |40| . we derive an ana- 
lytical approximation for the delay time distribution of our stochas- 
tic model. In the absence of arabinose, transcription of the gene 
for the uptake protein takes place at its basal rate v m - Neglect- 
ing operator state fluctuations 1441 . the probability to observe m 
transcription events up to time t follows a Poisson distribution 

m! 

with mean and variance v®, t. In the limit of short mRNA life- 

— 1 — 1 

time X m compared to the protein lifetime X p , one can assume 

instantaneous, geometrically distributed protein bursts from each 
mRNA molecule. This implies that the probability that m mRNA 
molecules produce n proteins follows a negative binomial distribu- 
tion 

M™ ( 6 X ( » +n - 1 ~ 
i+b) \i+b) 



where the burst size b = u p /X m is the average number of pro- 
teins produced from one mRNA molecule. Hence, the probability 
to produce n proteins up to time t is the weighed sum of nega- 
tive binomials P(n\u m t, b) = ~^2 m 'P{rn\v^ n t) ■ AfB(n\m,b). Setting 
t equal to the protein lifetime Xp 1 yields the steady state distri- 
bution of proteins, and for large fi = f m /A p we can replace the 
Poisson distribution by a delta function located at m = fx, lead- 
ing to Eq. [2] in the main text. Applying the transformation rule 

Q( t d) = I dn ^^ ' j P( n ) yields the delay time distribution in Eq. jsj 

and the moments (tb) and 6t^ = (t^) — ( t d)' 2 are determined by 
the integrals 

(td) = J dr D T D Q(T D ) = J dn^-P(n) , and 

(r 2 D ) = J dr D r 2 D Q(r D ) = j dn^P(n). 

Here expansion of the integrands up to second order in Sn = n— (n) 
brings us to Eqs.|4] 



AfB(n\m,b) 



Parameter values. The effective arabinose threshold K up t ~ 
50 /iM and the promoter binding constant Kp = 10 nM are chosen 
similar to the parameters of Pbad 1261 1451 . This choice determines 
the ratio Kg/[C] = K upt /Kp (see above) and by choosing a typical 
value of [C] = 100 nM we obtain K c = 10 6 /xM 3 . For the maxi- 
mal promoter activity we set a typical value for the promoters in 
the ara- regulon, v m = 5 mRNA/min, which was derived from the 
mRNA steady state levels reported in 1431 . With a promoter fold- 
change of 150 similar to Pe and Pfgh 33 ■ the basal transcription 
rate is expected to be on the order of u m 0.03 mRNA/min . From 
our fits of Eq. [3] to the experimental delay time distributions we 
obtained an average value of [i = v m l^v = 3.8. With our protein 
dilution rate of X p = 7 = ln(2)/(50 min) (from our measurement 
of the growth rate, see main text), this yields a basal expression 
rate of v m RJ 0.05 mRNA/min - in good agreement with the bio- 
chemical constraints stated before. The mRNA degradation rate 
A m is set according to a half-life of 2 min 1431 . allowing us to adjust 
the translation rate v v to match a typical burst factor of b = 30 
1411 . The Km for arabinose uptake is in wildtype cells about 50 
|39| . and the maximal uptake rate per uptake protein, v max , can 
be estimated from bulk measurements in which the uptake rate 
per total cellular dry mass was determined 39 . By assuming a 
dry mass of 3 ■ 10 — 13 g per cell and about 10 3 — 10 4 uptake 
proteins per cell 1471 . we end up with Vmax = 200 — 2000 arabinose 
molecules/protcin/min. From a Lineweaver-Burk fit to the data 
in Fig. [8] (b) we obtained v max RJ 120 molecules/protein/min and 
K m = 2.8 mM. While the value for v max is compatible with the 
biochemical constraints, our Km differs by two orders of magnitude 
from the previously reported value of 50 fiM |39| . For such a small 
Michaelis constant, all arabinose concentrations used in our ex- 
periments would saturate the uptake system completely and hence 
there should be no difference in timing of gene induction. However, 
the experimental conditions of Ref. |39l differ from ours; in partic- 
ular, the proton gradient between periplasm and cytoplasm, which 
drives the arabinose/P + symport by AraE, is limited by oxygen 
availability |48| . For the case of the lactose/P+ symporter LacY, 
it has been shown that a reduced proton gradient leads to an in- 
crease of the apparent K m ^49^. Hence, oxygen limitation in our 
micofluidic setup could explain the observed discrepancy. 

Stochastic simulations. Although in the rate equations above 
only the equilibrium constants are depicted, we took for the dy- 
namical simulations all association- and dissociation processes ex- 
plicitly into account. As a conservative assumption, all associa- 
tion rates were chosen 10-fold smaller than the diffusion-limited 
on-rate of 2nM _1 min for a typical transcription factor in E. 
coli |50| and the dissociation rates were adjusted according to the 
respective equilibrium constant. The trajectories in Fig. [7] (b) and 
(c) correspond to single kinetic Monte-Carlo simulations |51| for 
0.01% external arabinose. The protein and delay-time distribu- 
tions in Fig. [7] (a) and (d) (solid lines) were obtained from 5 ■ 10 4 
independent simulation runs with the same parameters. 
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